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TECHNICAL FIELD 

The present invention relates to production of alkaloids from poppy plants and in 
particular to genes encoding enzymes in the alkaloid pathway, to proteins encoded by the 
genes and to methods of increasing alkaloid content of poppy plants. 
5 INTRODUCTION 

The opium poppy Papaver somniferum is grown under strict government control, 
for the production of medically useful alkaloids such as morphine and codeine. The 
alkaloid content of poppy straw (includes threshed poppy capsules) is the most important 
parameter in the efficiency of opium alkaloid production. There have been numerous 
10 attempts to increase the yield of alkaloid per ton of poppy material. The vast majority of 
approaches focus on improving agricultural practices and on established methods of 
conventional breeding in the attempt to increase cultivation efficiencies and to 
modifying the genotype of the opium poppy plants. 

In addition to increasing the overall yield of opium alkaloids, the relative content 
15 of particular alkaloids in the poppy plants is also of considerable importance and has an 
impact on efficiency of processing of the plant material and the ultimate yield and cost 
of an alkaloid. 

Usually only one of the many alkaloids that can be produced by a poppy plant is 
found as the predominant alkaloid. In the opium poppy this is predominantly morphine 
20 which accumulates after flowering of the plant. However, before flowering thebaine is 
most abundant. The reason for such bias can be explained, at least in part, by analysis of 
what is currently known about alkaloid metabolism in the opium poppy and its 
regulation. 

The network of reactions, enzymes, co-factors and metabolic intermediates 
25 leading to the synthesis of alkaloids in the opium poppy constitute a complex metabolic 
pathway which is regulated at numerous points. There are also thought to be a number 
of rate limiting steps ("bottlenecks") where limitations in the availability of either 
substrates, co-factors or certain enzymes, determine which particular branch of the 
synthetic pathway is favoured and therefore the ultimate "mix" of alkaloids and the type 
30 of alkaloid which is predominantly in the plant. A class of enzymes known as 
cytochrome P-450 are known to be involved in the synthesis of several intermediates in 
the .pathway. However, unlike the enzymology of mammalian cytochrome P450 
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enzymes, similar plant en2ymes are considerably less abundant (Biochimie 1987, 
69:743-752) and have been less clearly described. It is known that plant P450 enzymes 
are like mammalian proteins and that they are hemoproteins which have a common 
prosthetic group containing iron and are membrane-bound proteins found within the 
5 endoplasmic reticulum. Generally, the P450-dependent enzymes catalyse the transferral 
of oxygen to the substrate and effectively remove one of the atoms from an oxygen 
molecule and are also referred to as monooxygenases. The reactions are dependent on a 
range of co-factors including NADPH and a second enzyme P450 cytochrome reductase. 
More particularly, in the biosynthesis of alkaloids in plants, cytochrome P-450- 

10 dependent oxidases and monooxygenases have been shown to catalyse highly regio- and 
stereoselective reactions. Hydroxylases and oxidases specific to alkaloid biosynthesis 
have been identified and characterised for the protopine, berberine, 
bisbenzylisoquinoline, benzophenanthridine, morphinan and monoterpenoid indole 
alkaloid biosynthetic pathways. The role of cytochrome P-450s in alkaloid biosynthesis 

15 is exemplified by the biosynthesis of sanguinarine in Eschscholzia californica 
(California poppy). Of the six oxidative transformations involved in the conversion of 
(iS)-reticuline to sanguinarine, four are thought to be catalysed by cytochrome P-450- 
dependent enzymes. 

Thus, plant cytochrome P-450-dependent enzymes, including those from the 
20 alkaloid poppy, constitute a substrate-specific class of enzymes that differs from their 
mammalian counterpart in the high regio- and stereospecificity as well as in the novel 
nature of the reactions catalysed. The enzymes involved in the alkaloid biosynthetic 
pathway require among other things the presence of a cytochrome P-450 reductase 
enzyme. Plant cytochrome P-450 reductases have been purified or enriched from C 
25 roseus (1,8) sweet potato (9), Helianthus tuberosus (Jerusalem artichoke) (10), Glycine 
max (soybean) cell suspension cultures (11), Pueraria lobata (12) and petunia flowers 
(13). cDNA encoding cytochrome P-450 reductase has been isolated from Vigna radiata 
(mung bean) (14), C roseus (15), K tuberosous "(accession Z26250, Z26251), Vicia 
sativa (accession Z26252) and Arabidopsis (16). cDNA cloning and heterologous 
30 expression in E. coli of the G roseus cytochrome P-450 reductase has been reported 
(15). 
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The reductase is responsible for providing electrons to the P450 and is thought to 
be a relatively promiscuous enzyme in that a particular reductase species will reduce a 
range of distinct P450s. It is also known that the cytochrome P450 enzymes are in molar 
excess to the level of P450 reductase. This imbalance may be a regulatory step for the 
5 reduction and therefore be rate-limiting of the cytochrome P450 activity. Although there 
is some promiscuity within species, available data suggests that there is poor 
transferability of reductases from diverged species. For example, although cytochrome 
P-450 reductase from insect cell culture and porcine liver was shown to transfer 
electrons to heterologously expressed Berberis berbamunine synthase, the highest 

1 0 turnover number was achieved with the Berberis reductase (7). 

Notwithstanding this body of work, to date it has not been possible to establish 
the exact nature of the "bottlenecks" in the alkaloid metabolism pathway or to identify 
the key enzymes which may be responsible and which could be used to manipulate 
alkaloid metabolism in the opium poppy in order to achieve higher yields of alkaloids 

1 5 generally, and specific alkaloids in particular. 

As the cost of producing poppy alkaloids is very dependent on the alkaloid 
content of poppy straw, it would be a major advantage if high alkaloid containing straw 
could be obtained rather than to attempt to increase the yield of straw. In fact, it is 
possible that any increase in the yield of straw may result in the relative content of 

20 alkaloid decreasing through dilution. High alkaloid-containing straw would provide 
efficiencies throughout the CPS ("Concentrate of Poppy Straw") production process. If 
high crop yields can be achieved, either less hectares of crop need be grown or the pre- 
existing areas can be used to increase production. High yield crops would also reduce 
the cost of harvest, transport, drying, storage, processing and waste disposal per unit 

25 weight of alkaloid. Thus, to increase the yield of an alkaloid it would be most efficient 
to manipulate the plants to increase alkaloid content of the straw rather than to increase 
the yield of straw. 

SUMMARY OF THE INVENTION 

It has now been found that among the rate-limiting steps in the production of 
30 morphine in Papaver somniferwn are the steps which depend on the cytochrome P-450 
enzymes, and therefore in turn on the cytochrome P-450 reductases. This observation 
has Jed to the identification and isolation of cytochrome P-450 reductase enzymes in the 
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alkaloid poppy, the isolation and characterisation of polynucleotides encoding the 
reductase enzymes, the expression of the polynucleotides encoding the reductases in 
eukaryotic and prokaryotic expression systems, including plant cells and transfected or 
transformed plants. The identification of cytochrome P-450 reductase genes and their 
5 products in poppy plants now enables methods of controlling the total alkaloid content of 
a plant, the ultimate "mix" of alkaloids as well as the type of predominant alkaloid 
synthesised by the plant. This can be achieved by alleviating the "bottlenecks" in the 
pathway through overexpression of the relevant reductase genes in plants transformed or 
transfected with a nucleotide sequence encoding an appropriate P-450 reductase enzyme. 

10 Thus, according to a first aspect there is provided an isolated and purified 

polynucleotide encoding a cytochrome P-450 reductase enzyme from an alkaloid poppy 
plant, or a variant or fragment thereof v 

The polynucleotide may be selected from the group consisting of genomic DNA 
(gDNA), cDNA, or synthetic DNA. The preferred polynucleotides encoding a 

15 cytochrome P-450 reductase are selected from those shown in Figures 9a and 9b or 
fragments thereof. It will be understood however that sequences shown in the Figures 9a 
and 9b may be expressed in the absence of the native leader sequences or any of the 5' or 
3' untranslated regions of the polynucleotide. Such regions of the polynucleotide may 
be replaced with exogenous control/regulatory sequences in order to optimise/enhance 

20 expression of the sequence in an expression system. Figures 10a and 10b represent 
examples of truncated polynucleotide sequences encoding cytochrome P-450 reductases 
of P. somniferum and E. californica respectively, wherein the native leader sequences 
have been removed in order to enhance the expression of the enzyme. 

The preferred alkaloid-producing poppy plants are Eschscholzia californica and 

25 Papaver somniferum. 

It will also be understood that analogues and variants of the polynucleotide 
encoding a cytochrome P-450 reductase from alkaloid poppy plants fall within the scope 
of the present invention. Such variants will still encode an enzyme with cytochrome P- 
450 reductase properties and may include codon substitutions or modifications which do 

30 not alter the amino acid encoded by the codon but which enable efficient expression of 
the polynucleotide encoding cytochrome P-450 reductase enzyme in a chose expression 
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system. Other variants may be naturally occurring, for example allelic variants or 
isoforms. 

According to a second aspect there is provided an isolated and purified 
polynucleotide which binds under stringent conditions to the nucleotide sequence 
5 encoding a cytochrome P-450 reductase enzyme from an alkaloid poppy plant, or a 
fragment thereof. 

Such complementary polynucleotides are useful in the present invention as 
probes and primers, as antisense agents or may be used in the design of other suppressive 
agents such as ribozymes and the like. 

10 According to a third aspect there is provided an isolated and purified 

polynucleotide which codes for prokaryotic or eukaryotic expression of a cytochrome P- 
450 reductase enzyme from an alkaloid poppy plant, or a variant or fragment thereof, 
wherein the polynucleotide is expressed in an environment selected from the group 
consisting of the extracellular environment, an intracellular membranous compartment, 

1 5 intracellular cytoplasmic compartment or combinations thereof. 

The polynucleotide encoding a cytochrome P-450 reductase may be coupled to 
another nucleotide sequence which would assist or directing the expression of the 
reductase with respect to a particular cellular compartment or the extracellular 
environment. 

20 According to a fourth aspect there is provided an isolated and purified 

cytochrome P-450 reductase enzyme from an alkaloid poppy plant, being a product of 
prokaryotic or eukaryotic expression of the polynucleotide of the first aspect or the 
second aspect. 

Variants of the cytochrome P-450 reductase enzyme which incorporate amino 
25 acid deletions, substitutions, additions or combinations thereof, are also contemplated. 
The variants can be advantageously prepared by introducing appropriate codon 
mutations, deletions, insertions or combinations thereof, into the polynucleotide 
encoding the P-450 reductase enzyme. Such variants will retain the properties of the P- 
450 reductase enzyme, either in vivo or in vitro. Other variants may be naturally 
30 occurring, for example allelic variants or isoforms. 

The cytochrome P-450 reductase may be expressed in and by a variety of 
eukaryotic and prokaryotic cells and organisms., including bacterial-yeasts, insect cells, 
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mammalian and other vertebrate cells, or plant cells. Preferably the expression system is 

a plant expression system and even more preferred is an alkaloid poppy plant. Suitable 

alkaloid poppy plants are Eschscholzia californica and Papaver somniferum. 

For expression of cytochrome P-450 reductase activity, a fragment of the 
5 polynucleotide encoding a cytochrome P-450 reductase may be employed, such fragment 

encodes functionally relevant regions, motifs or domains of the reductase protein. 

Similarly, fragments of the P-450 reductase enzyme resulting from the recombinant 

expression of the polynucleotide may be used. Functionally important domains of 

cytochrome P-450 reductase may be represented by individual exons or may be 
10 identified as being highly conserved regions of the protein molecule. Those parts of the 

cytochrome P-450 reductase which are not highly conserved may have important 

functional properties in a particular expression system. 

According to a fifth aspect there is provided a method for preparing plants which 

overexpress a cytochrome P-450 reductase enzyme, comprising transfecting or 
1 5 transforming a plant cell, a plant part or a plant with the polynucleotide according to the 

first aspect or the second aspect. 

Preferably, the plants overexpressing the P-450 reductase are Eschscholzia 

californica and Papaver somniferum. Suitable promoters to control the expression of the 

P-450 reductase gene may be derived from for example cauliflower mosaic virus or 
20 subterranean clover mosaic virus. Other virus promoters may also be suitable. Further, 

the use of the endogenous promoter may also be appropriate in certain circumstances. 

Such a promoter may be co-isolated with the gDNA encoding the P-450 reductase 

enzyme. 

According to a sixth aspect there is provided a cell transformed or transfected 
25 with a polynucleotide of the first aspect or the second aspect. 

Cells which may be transfected or transformed with a polynucleotide encoding a 
cytochrome P-450 reductase are bacterial, yeast, animal or plant cells. For preference 
the cells are plant cells. Even more preferred are cells from an alkaloid poppy plant, 
such as Eschscholzia californica or Papaver somniferum. 
30 According to a seventh aspect there is provided a plant transformed or transfected 

with a polynucleotide according to the first aspect or the second aspect, wherein the plant 
exhibits modified expression of the cytochrome P-450 reductase enzyme. 
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For preference, the modified expression manifests itself in overexpression of the 
cytochrome P-450 reductase enzyme. However, reduced expression of cytochrome P- 
450 reductase can also be achieved if the plant is transformed or transfected with a 
polynucleotide which is complementary to the polynucleotide encoding the reductase. 
5 According to a eighth aspect there is provided an alkaloid poppy plant 

transformed or transfected with a polynucleotide according to the first aspect or the 
second aspect, wherein the plant has a higher or different alkaloid content when 
compared to a plant which has not been so transformed or transfected. 

Preferably the plants expressing higher alkaloid content are selected from 
10 Eschscholzia californica and Papaver somniferum. Even more preferred is Papaver 
somniferum. 

According to a ninth aspect there is provided a recombinant DNA molecule 
comprising the polynucleotide according to the first aspect or the second aspect. 

Preferably the recombinant DNA molecule is a viral or plasmid vector. Such a 
1 5 vector may direct prokaryotic or eukaryotic expression of the polynucleotide encoding a 
cytochrome P-450 reductase or it may prevent or reduce its expression. 

According to a tenth aspect there is provided a method of altering the yield or 
nature of alkaloid in a plant comprising transforming or transfecting a plant with a 
polynucleotide encoding a cytochrome P-450 reductase enzyme or a variant or fragment 
20 thereof, or with a polynucleotide which binds under stringent conditions to the 
polynucleotide encoding said enzyme. 

According to a eleventh aspect there is provided a method of increasing the yield 
of alkaloid in a plant comprising transforming or transfecting a plant with a 
polynucleotide encoding a cytochrome P-450 reductase enzyme or a variant or fragment 
25 thereof, wherein the enzyme is overexpressed in said plant. 

According to a twelfth aspect there is provided a method of altering type or blend 
of alkaloid in a plant comprising transforming or transfecting a plant with a with a 
polynucleotide encoding a cytochrome P-450 reductase enzyme or a variant or fragment 
thereof, or with a polynucleotide which binds under stringent conditions to the 
30 polynucleotide encoding said enzyme. 
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For preference the alkaloid is morphine, codeine, oripavine or thebaine, but it 
will be understood that other intermediates in the alkaloid metabolic pathway are also 
within the scope of the present invention. 

BRIEF DESCRIPTION OF FIGURES 

5 Figure. 1. SDS-PAGE analysis of fractions from the purification of cytochrome P- 
450 reductase from P. somniferum cell suspension cultures. Protein bands were 
visualized by silver staining. Lane 1, protein standards; lane 2, affinity chromatography 
elution buffer without protein: lane 3, 1 jig protein from the 2',5'-ADP Sepharose 4B 
eluate after dialysis; lane 4, 4 jig microsomal protein; lanes 5,6, 4 |ig solubilized 

10 microsomal protein; lane 7, 4 \x% protein from the DEAE cellulose eluate. 

Figure 2. Amino acid sequences of seven endoproteinase Lys-C-generated peptides 
of the cytochrome P-450 reductase from P. somniferum cell suspension cultures. 
Figure 3. Partial amino acid sequence comparison of plant cytochrome P-450 
reductases. The shaded areas and arrows indicate the position and direction of the 

1 5 regions used for PCR oligodeoxynucleotide primer design. 

Figure 4. Genomic DNA gel blot analysis of (A) P. somniferum hybridized to the 
P. somniferum full-length cDNA and (B) E.californica hybridzed to the E.californica 
full-length cDNA and to (C) the 288 bp PCR fragent corresponding to the second 
isoform. The numbers following the restriction enzyme names indicate the number of 

20 recognition sites that occur in the reading frame. For the second E.californica isoform, 
this is known only over a 288 bp region. 

Figure 5. Comparison of the amino acid sequences of the cytochrome P-450 
reductase from P. somniferum and from E. californica. Top sequence, E. californica; 
bottom sequence, P. somniferum; *, amino acid identity. 
25 Figure. 6. Nucleotide sequences of cDNA from (a) P. somniferum, and (b) E. 
californica. 

Figure 7. Functional expression of cytochrome P-450 reductases in yeast and insect 

cell culture. (A) Expression of pYES2/PsoCPRI ( ), pRS405/PsoCPRII 

( ), pYES2/PsoCP ( ), control ( ); (B) pFastBac/PsoCPRII 

30 ( ), control ( ); (C) P YES2/EcaCPRII ( ), P RS405/EcaCPRII 
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( ), control ( ); (D) pFastBac/EcaCPRII ( ), control 

( ). Pso CPR, P. somniferum cytochrome P-450 reductase; Eca CPR, E. 

californica cytochrome P-450 reductase; Sf9, S.frugiperda Sf9 cell culture. 
Figure 8. Restriction enzyme map (unique sites) for cDNA sequences of (a) P. 
5 somniferum, and (b) E. californica. 

Figure 9. Amino acid sequences of (a) P. somniferum, and (b) E. californica, 
predicted from their respective cDNA nucleotide sequences. The start and stop codons 
are depicted in bold. 

Figure 10. cDNA nucleotide sequences and their predicted amino acid sequences, of 
10 fragments subcloned into an expression vector: (a) P. somniferum, and (b) E. 
californica. Both sequences are truncated versions of sequences represented in Figures 
9a and 9b, lacking the leader sequences. Extra vector sequences/restriction sites derived 
during subcloning are shown in lowercase and the cDNA in uppercase. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

1 5 The details of the metabolic pathway leading to synthesis of opium alkaloids in 

the opium poppy, Papaver somniferum, part of which is depicted in Scheme I. 
Typically, the P450 enzyme exists in a 15 - 20 fold excess as compared to the reductase 
level and as there is approximately a 6:1 dependence between the two enzymes, it is 
feasible that the reductase levels are limiting the rate of the cytochrome P450 enzyme. 

20 By supplying plant tissue with radiolabeled compounds and following the accumulation 
of radioactivity in the various intermediates in the pathway it was shown that addition of 
radiolabel (labeled reticuline, salutaridinol) before thebaine results in accumulation of 
radioactivity at thebaine. Addition of radiolabeled compounds after thebaine result in 
the accumulation of radioactivity at codeine. 

25 Oripavine is an intermediate from a second route of conversion from thebaine to 

morphine. It is thought that thebaine is converted to oripivine by the same 3 
demethylase that converts codeine to morphinone. The slow modification of the isotopic 
oripavine is probably due to the rate limitation of the 6 demethylase. 

With the assistance of such experiments it has now been found that among the 

30 rate-limiting steps in the production of morphine in Papaver somniferum are the steps 
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which depend on the reduction of cytochrome P-450 by the cytochrome P-450 
reductases. 

Thus the following steps are known or suspected to be catalysed by P-450 
enzymes which are rate limiting: 
5 1 (R)-reticuline -> salutaridine 

2 thebaine — » neopinone — ► codeinone 

3 codeine -> morphine 
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SCHEME I 
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The importance of cytochrome P-450 in alkaloid biosynthesis is also exemplified 
by the biosynthesis of sanguinarine in Eschscholzia californica (California poppy). This 
biosynthetic pathway is shown in Scheme II. 
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The present invention provides by way of example the sequence of the P-450 
reductase enzyme genes from two poppy species, the opium poppy Papaver somniferum 
and the Californian poppy Eschscholzia californica. The sequence information has been 
shown to code for the enzymes by expression in a heterologous expression system 
5 followed by biochemical characterisation. These studies have also shown that the 
alteration of the ratio and species of reductase will significantly alter the interaction of 
the cytochrome P-450 with its substrate suggesting a lack of tolerance for general 
interchange of reductase genes. 

The over-expression of the P-450 reductase gene in an alkaloid producing pant 
10 will alleviate the rate-limitation of the P450 by increasing the rate of reduction of the 
active P-450 enzyme. In brief, controlling the reductase should control the P-450 
cytochrome. 

The information on the protein coding region of cytochrome P-450 reductase 
enzymes may be applied to increase yields of alkaloids in the poppy plant as follows: 
15 1) obtain the gDNA or cDNA sequence of the gene from the target plant and a 
closely related plant. 

2) sub-clone the gDNA or cDNA into a plasmid vector that contains the following: 

• a promoter suitable for overexpression of the cDNA in poppy, for example a 
promoter derived from the cauliflower mosaic virus or the subterranean clover 

20 mosaic virus. 

• a selectable marker linked to a different promoter to facilitate the selection of 
transformants. Marker could be a dominant marker such as a herbicide resistance 
gene or an antibiotic resistance gene. 

• suitable selectable markers and replication origins for maintenance of the 
25 plasmid in bacteria 

• suitable sequences to facilitate mobilisation of the plasmid by Agrobacterium 
tumefacians-med'xaled transformation. 

3) transform a suitable strain of A. tumefacians and then co-cultivate the bacteria 
with suitable samples of plant tissue such as callus, embryonic tissue or hypocotyl tissue. 

30 4) place treated tissue on selectable media and provide appropriate media to 
promote differentiation and plant re-generation. 
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5) characterise candidate plants by Southern and Northern blotting to confirm 
integration of gene and expression in appropriate tissues 

6) self-pollinate transformed plants, analyse segregants to identify hemizygotes and 
homozygotes 

5 7) analyse biochemistry of transgenic plants. 

Isotope labelling can be used to identify bottlenecks and HPLC analysis will 
determine levels of alkaloids. 

In order to develop optimised convenient heterologous expression systems for the 
cytochrome P-450-dependent oxidases of select isoquinoline alkaloid-producing plant 
10 species, facile PCR-based method have been developed with which to clone cytochrome 
P-450 reductase and express the enzyme in yeast and insect cell culture as initial 
expression systems. 

The invention will now be described with reference to specific examples. 

EXAMPLES 

15 Example 1: Enzyme purification and amino acid sequencing: 

Plant cell cultures . Cell suspension cultures of P. somniferum and E. californica 
were routinely grown in 1 -litre conical flasks containing 400 ml of Linsmaier-Skoog 
medium (17) over 7 days at 23 °C on a gyratory shaker (100 rpm) in diffuse light (750 
lux). Elicitation of E. californica cell suspension cultures was achieved by the asceptic 

20 addition of methyl jasmonate to a final concentration of 100 ^iM to the medium (18). 

Purification and sequence analysis . Cells were harvested from seven-day-old 
suspension cultures of P, somniferum by vacuum filtration, immediately shock frozen 
and stored at -20°C. All of the following operations were carried out at 4°C. 500 g 
frozen tissue were then homogenized with a mortar and pestle in 1 litre 0.1 M 

25 tricine/NaOH, pH 7.5 containing 15 mM thioglycolic acid. Cell debris was removed by 
centrifugation at 10,000 x g, 30 min. The supernatant was filtered through four layers of 
cheesecloth and the microsomes were then isolated by MgCl 2 precipitation according to 
(19). In a typical preparation, 500 g fresh weight of cells yielded 8-10 mg/ml 
microsomal protein. Microsomal protein was solubilized as follows. 2 mg CHAPS (3- 

30 [(3-cholamidopropyl)dimethyl-ammonio]-l-propane-sulfonate, Roth) per mg 
microsomal protein was prepared in 1 ml of 0.1 M tricine/NaOH, pH 7.5 containing 15 
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mM thioglycolic acid. This solution was added dropwise to the microsomal suspension. 
2% (v/v) Emulgen 91 1 (Kao Corporation) was then added and the solution slowly stirred 
for 1 h. Membrane fragments were removed by centrifugation at 105,000 x g for 60 min. 
The total activity in the solubilized microsomes was assigned the value 100%. The 
5 solubilized cytochrome P-450 reductase was then purified to electrophoretic 
homogeneity according to (20). In this manner, 50 ng cytochrome P-450 reductase was 
purified from 8 kg P. somniferum cell suspension culture in 97% yield. 

The purified enzyme preparation was subjected to SDS/PAGE to remove traces 
of Emulgen 911 and CHAPS, and the coomassie brilliant blue R-250- visualized band 

10 representing the cytochrome P-450 reductase was digested in situ with endoproteinase 
Lys- C as reported in (21). The peptide mixture thereby obtained was resolved by 
reversed phase HPLC [column, Merck Lichrospher RP18; 5 |im (4 x 125 mm); solvent 
system, (A) 0.1% trifluoroacetic acid, (B) 0.1% trifluoroacetic acid / 60% acetonitrile; 
gradient of 1% per min; flow rate of lml/min] with detection at 206 nm. The scheme for 

1 5 the purification of the cytochrome P-450 reductase is given in Table I. 

TABLE I Purification of Cytochrome P-450 Reductase from P. somniferum Cell 



Suspension Cultures 



Purification 


Total 


Total 


Specific 


Purification 


Yield (%) 


Step 


protein 


activity 


activity 


factor (fold) 






(mg) 


(nkatal) 


(nkatal/mg) 






Microsomes 


308 


59 


0.2 






Solubilized 


244 


71 


0.3 


1 


100 


microsomes 












DEAE 


21 


71 


3.4 


11 


100 


Cellulose 












2\5'-ADP 


0.05 


47 


927 


3100 


66 


Sepharose 












4B 












Dialysis 


0.05 


69 


1385 


4600 


97 



Following this facile purification procedure (20), 50 [ig of enzyme could be 
purified to near electrophoretic homogeneity from 8 kg fresh weight of cell suspension 
20 culture with minimal loss of activity. Gel electrophoretic analysis of aliquots of the 
purification steps suggest that there may be two isoforms of the cytochrome P-450 
reductase in P. somniferum as there were two protein bands .present in the 2\5'-ADP 
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Sepharose 4B eluate at 80 kDa (Fig. 1). To further test the possible presence of 
isoforms, 10 protein from the 2',5-ADP Sepharose 4B eluate was subjected to native 
polyacrylamide gel electrophoresis, the two closely migrating protein bands were eluted 
and both tested positive for cytochrome c reduction. These two isozymes could not be 
5 chromatographically resolved and were therefore characterized together. 

The purified reductase exhibited a pH optimum at 8.0 in 0.5 M Tricine buffer. 
The optimal molarity range of the Tricine buffer was determined to be 250-500 mM. At 
100 mM and at 1 M Tricine, the activity declined to 21% and 77%, respectively. The K m 
value for cytochrome c was 8.3 jiM and that for the cofactor NADPH was 4.2 \iM. The 
10 distribution of the cytochrome P-450 reductase in a 3-month-old P. somniferum plant is 
given in Table II. 

TABLE II Distribution of Cytochrome P-450 Reductase Activity in a 3-Month-Old 



P. somniferum Plant 



Plant part 

Capsule 


Specific activity 
(pkatal/g dry weight) 


Specific activity 
(pkatal/mg protein) 


2700 


660 


stem 


2000 


930 


Leaf 


840 


390 


Root 


670 





On a dry weight basis, the highest activity is present in the capsule. 

15 Microsequencing was accomplished with an Applied Biosystems model 470 gas- 

phase sequencer. The amino acid sequence of seven endoproteinase Lys-C-generated 
peptides was determined on the mixture of both isozymes (Fig. 2). A comparison of 
these amino acid sequences with those available for plant cytochrome P-450 reductases 
in the GenBank/EMBL sequence database allowed the relative positioning of the seven 

20 internal peptides due to high sequence homology. This also served as supportive 
evidence that the isozymes that were purified were indeed cytochrome P-450 reductases. 

Example 2: Generation of partial cDNAs from P. somniferum and E. californica. 

Optimized PCR primers were then designed based on highly homologous sites on 
both the amino acid and nucleotide levels in the plant cytochrome P-450 reductase 
25 sequence comparison (Fig. 3). The precise sequence of the primers used for the first 
round of PCR was: 




- la- 
s' -CA ITI CII CCT CCT TTC CC-3 1 and 
T 

3 1 -ACC TAC TTC TTA CGI CAA GG-5 1 . 

C TGC 

5 Polymerase chain reaction (PCR) generated partial cDNAs encoding cytochrome 

P-450 reductases from P. somniferum and E. calif ornica were produced by PCR using 
cDNA produced by reverse transcription of total RNA isolated from 3 to 5-day-old 
suspension cultured cells. DNA amplification was performed under the following 
conditions: 5 cycles of 94°C, 30 sec; 45°C, 1 min; 72°C, 1 min; 25 cycles of 94°C, 30 
10 sec; 55°C, 30 sec, 72°C, 1 min. At the end of 30 cycles, the reaction mixtures were 
incubated for an additional 5 min at 72°C prior to cooling to 4°C. The amplified DNA 
was then resolved by agarose gel electrophoresis, the bands of approximately the correct 
size were isolated and subcloned into pGEM-T (Promega) prior to nucleotide sequence 
determination. 

15 Resolution of this first PCR experiment by agarose gel electrophoresis revealed a 

mixture of DNA products in the expected range of 400-450 bp. The bands in this size 
range were eluted from the gel and used as template for nested PCR with the following 
primers: 

5'-CA ITI CII CCT CCT TTC CC-3' and 
20 T 

3 ' -AAA CGI CGI TAI CGI GGI GCI IGI GTT GG-5' 
G C 

The result from the nested PCR was a single DNA band with the expected size of 
288 bp. The translation of the nucleotide sequence of this PCR product indicated that it 

25 was indeed encoding a cytochrome P-450 reductase. This 288 bp PCR-generated partial 
cDNA was then used as hybridization probe to screen an amplified P. somniferum cell 
suspension culture cDNA library. In this manner, from a total of 300,000 clones 
screened, two positive clones were isolated. Of these two positive clones, one was 
determine to be full-length by a restriction endonuclease analysis. The nucleotide 

30 sequence of this full-length cDNA clone was then determined for both strands. The 
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reading frame coded for 684 amino acids corresponding to a relative molecular mass of 
77.5 kDa. 

An identical PCR-based approached was also carried out with RNA isolated from 
methyl jasmonate-induced E. californica cell suspension cultures (26). Nucleotide 
5 sequence determination of the 288 bp DNA fragment indicated that in E. californica one 
cytochrome P-450 reductase form is present. However, screening of 400,000 clones of a 
primary cDNA library prepared from RNA isolated from methyl jasmonate-induced E. 
californica cell suspension cultures resulted in the isolation of one partial and one full- 
length clone, both of which encoded a second isoform. The nucleotide sequence of this 

10 full-length cDNA clone was then determined for both strands. The reading frame 
encoded 705 amino acids that corresponded to a relative molecular mass of 78.7 kDa. 
RNA gel blot analysis indicated that this isoform gene is weakly induced two-fold by 
treatment of the cell cultures with methyl jasmonate. Genomic DNA gel blot analysis of 
each reductase indicates that one gene encodes each isoform in E. californica and that 

1 5 one gene also encodes the cloned isoform in P. somniferum (Fig. 4). 

The overall sequence homology of the cytochrome P-450 reductase from P. 
somniferum and that from E. californica is 63% identity at the nucleotide level and 69% 
identity at the amino acid level (Fig. 5). This compares to an overall sequence identity to 
other plant cytochrome P-450 reductases of approximately 50% at both the nucleotide 

20 and amino acid levels. 

Nucleotide sequence determination . The entire nucleotide sequence on both 
DNA strands of full-length cDNA clones in pBluescript was determined by dideoxy 
cycle -sequencing using internal DNA sequences for the design of deoxyoligonucleotides 
as sequencing primers. Nucleotide sequences of cDNAs of P, somniferum and E. 

25 californica are given in Figures 6A and 6B, respectively. 

Alternative approaches. cDNA can also be prepared by isolating RNA from 
either plant cell suspension cultures or from different material, according to a method 
using LiCl precipitation of ribonucleic acid as described in "Current Protocols in 
Molecular Biology" Ausubel FM 3 Brent R, Kingston RE, Moore DD, Seidman JG, 

30 Smith JA, Struhl K, eds John Wiley & Sons, Inc. New York (1987). mRNA was then 
isolated from the total RNA using either an oligo dT cellulose column or oligo dT beads 
(Oligotex beads, QIAGEN) according to the manufacturers instructions. The cDNA 
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libraries were prepared from mRNA with cDNA and lambda ZAP kits from Stratagen 
(La Jolla, California, USA), according to the manufacturers instructions. 

Example 3: cDNA isolation and heterologous expression of cytochrome P-450 
reductase in Saccharomyces cerevisiae. 
5 cDNA clones encoding the Papaver and Eschscholzia cytochrome P-450 

reductases were isolated by screening of cDNA libraries prepared in either A.-ZAP II or 
Uni-ZAP XR (Stratagene) using the partial clones generated by PCR as hybridization 
probe. The clones that yielded positive results through a third screening were converted 
to pBluescript KS (+) by excision. After determination of the nucleotide sequence on 
10 both strands, the full length reading frame, free of the 5 1 - and 3 f -flanking sequences, was 
generated by PCR using either Taq DNA polymerase (Perkin Elmer) and was subcloned 
into pGEM-T (Promega) or Pfu DNA polymerase and was subcloned into pCR-Script 
SK (+) (Stratagene). 

The P. somniferum cytochrome P-450 reductase cDNA in pGEM-T, designated 

15 pGEM-T/PsoCPR, was digested with the restriction endonucleases Not I and Hin dill 
and the 2096 bp fragment was ligated into Not \IHin dill digested pYES2 (autonomously 
replicating yeast expression vector from Invitrogen) to produce the expression plasmid 
pYES2/PsoCPRI. This particular construction had 27 bp of a noncoding region 
upstream from the AUG start codon. This was reduced to 6 bp by digestion of 

20 pYES2/PsoCPRI with Hin dill and Cla I. This 55 bp restriction fragment was then 
replaced by ligation with a synthetic DNA adaptor of a sequence that replaced the 
reading frame from the internal Cla I site through the start codon, which was 
immediately preceded by a Hin dill recognition sequence. The resulting construct was 
termed pYES2/PsoCPRIL 

25 The E. californica cytochrome P-450 reductase cDNA in pGEM-T, designated 

pGEM-T/EcaCPR, was digested with the restriction endonucleases Sal I and Not I and 
the 2289 bp fragment was ligated into Sal VNot I digested pGEM-9Zf (-) (Promega). 
pGEM-9Zf/EcaCPR was then digested with Sst I and Not I and the 2292 bp fragment 
was ligated into Sst l/Not I digested pYES2 to produce the expression plasmid 

30 pYES2/EcaCPRI. The noncoding sequences upstream of the start codon were 
minimized by digestion with Sma I and Eco ICRI and the vector recircularized by blunt- 
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end ligation (plasmid termed pYES2/EcaCPRII). These autonomously replicating 
expression plasmids were then introduced into the Saccharomyces cerevisiae strain 
INVSC1 under uracil selection. 

Expression of the vector pYES2/PsoCPRI containing 27 noncoding nucleotides 
5 upstream of the start codon resulted in increased enzyme activity that was 2.6-fold 
greater than the yeast endogenous reductase (Fig. 7A). Shortening of this noncoding 
sequence to 6 bp in vector pYES2/PsoCPRII resulted in 9-fold greater enzyme activity 
than in the control yeast strain containing only the vector pYES2. Expression using the 
integrative yeast vector pRS405 was also investigated for the eventual possible 

10 heterologous co-expression of both a plant cytochrome P-450 reductase and a plant 
oxidase in yeast. Expression of the vector pRS405/PsoCPRII, in which transcription of 
the P. somniferum cytochrome P-450 reductase was also driven by the GAL1 gene 
promoter, resulted in 67% of the enzyme activity compared to the autonomously 
replicating vector pYES2/PsoCPRII. 

15 Expression of pYES2/EcaCPRII and of pRS405/EcaCPRII resulted in a 15-fold 

and 10- fold increase in activity over the endogenous yeast reductase, respectively (Fig. 
7C). 

The P. somniferum cytochrome P-450 reductase reading frame downstream from 
the GAL 1 promoter was generated by PCR from pYES2/PsoCPRII. The 2598 bp PCR 
20 product was ligated into pCRScript and then excised by digestion with Not I and Sal I. 
This 2669 bp Not \ISal I fragment was ligated into the Not VSal I digested yeast 
integrative expression vector pRS405 (Stratagene). 

. The E, californica cytochrome P-450 reductase reading frame downstream from 
the GAL 1 promoter was introduced into the yeast integrative expression vector pRS405 
25 by digestion of pYES2/EcaCPRII with Pst I and Not I and the 2835 bp fragment was 
ligated into Pst VNot I digested vector. The integrative expression plasmids 
pRS405/PsoCPRII and pRS405/EcaCPRII were then introduced into the S. cerevisiae 
strain INVSC1 under leucine selection. 

Yeast microsomes were isolated according to either (22) or (23) and the presence 
30 cytochrome P-450 reductase was measured as the ability to reduce cytochrome c (24). 
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Example 4: Heterologous expression of cytochrome P-450 reductase in 
Spodoptera frugiperda Sf9 cells. 
The P. somniferum cytochrome P-450 reductase cDNA construct 
pYES2/PsoCPRII was digested with Hin dill and Xba I and the resultant 2096 bp 
5 fragment was ligated into Hin dlll/Xba I digested pGEM-7Zf (+) (Promega). pGEM- 
7ZffPsoCPRII was then digested with Bam HI and Xho I and the 2090 bp fragment was 
ligated into Bam HUXho I digested pFastBacl (Life Technologies). 

The E. californica cytochrome P-450 reductase clone pGEM-T/EcaCPRII was 
digested with the restriction endonucleases Sma I and Not I and the 2251 bp fragment 
10 was ligated into pFastBacl that had been digested first with Bam HI, then with Pfu DNA 
polymerase to produce blunt ends, and finally with Not I. pFastBac/PsoCPRII and 
pFastBac/EcaCPRII were transposed into baculovirus DNA and then transfected into 
Spodoptera frugiperda Sf9 cells according to the manufacturer's instructions. The insect 
cells were propagated and the recombinant virus was amplified according to (7). 
1 5 Isolation of insect cell microsomes was performed as published (7) and the cytochrome c 
reducing activity measured as for the yeast expression. 

Heterologous expression in insect cell culture (S. frugiperda Sf9 cells) of 
pFastBac/PsoCPRII produced 4-fold more activity than the insect cell endogenous 
reductase, representing 40% of the activity produced by pYES2/PsoCPRII in yeast (Fig. 
20 7B). Expression of pFastBac/PsoCPRI, the construction containing a 27 bp long 5'- 
noncoding region, resulted in no measurable enzyme activity above that from the 
endogenous insect cell reductase. 

. Expression of pFastBac/EcaCPRII in insect cell culture produced a 10-fold 
increase in reductase activity (Fig 7D). The overexpression in insect cell culture was 
25 54% of that achieved in yeast. 

Example 5: Co-expression of cytochrome P-450 reductase and berbamunine 
synthase in Sf9 cells. 

To test for the possible effects of a plant cytochrome P-450 reductase as opposed 
to either yeast or insect cell reductase, several coexpressions were undertaken. 
30 Recombinant baculovirus containing either the P. somniferum or E. californica 

cytochrome P-450 cDNA was added simultaneously to S. frugiperda Sf9 cells (Gibco- 
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BRL) with recombinant virus containing the berbamunine synthase (CYP 80) cDNA (7). 
The oxidase virus was infected at a multiplicity of infection (MOI) of approximately 5 
and the amount of reductase virus varied from an MOI from 1-5. The infection were 
carried out as described in (25). 



phenol coupling enzyme of bisbenzylisoquinoline alkaloid biosynthesis from B. 
stolonifera, berbamunine synthase (7). In the presence of equimolar concentrations of 
the two substrates (5)-N-methylcoclaurine and (i?)-A r -methylcoclaurine, the native 
enzyme produces two products in a ratio of 90:10 (berbamunine (R,S- 

10 dimer):guattegaumerine (i?,i?-dimer)) that correspond to the ratio of these two alkaloids 
found in the Berberis plant (6). Berbamunine synthase expressed in insect cells 
produced, however, the dimers R,SJl,R in a ratio of 15:85 (7). Co-infection of insect cell 
culture with two baculovirus preparations, one containing berbamunine synthase cDNA 
and the other containing E. californica cytochrome P-450 reductase, in varying ratios 

15 resulted in a shift in the ratio of the enzymatic products formed as follows: 
oxidase:reductase (5:1), R f S:R t R (29:71); oxidase:reductase (1:1), R,S:R,R (35:65); 
oxidase:reductase (1:2), R.S:R,R (37:63). 

The isolation and functional expression of cDNAs encoding cytochrome P-450 
reductases from E. californica and P. somniferum described above were undertaken to 

20 develop suitable heterologous expression systems optimal for the active expression of 
select cytochrome P-450-dependent oxidases of alkaloid biosynthesis, thus providing a 
convenient test system. Initial characterization of the cytochrome P-450 reductase from 
P. somniferum indicated that with respect to molecular weight, K m and pH optimum, the 
reductase is similar to those characterized from other plant species (9,13,14). The 

25 purified reductase resolved into two closely migrating bands on SDS-PAGE, suggesting 
that isoforms are present in P. somniferum. This is similar to the finding that multiple 
reductase isoforms are present in Arabidopsis thaliana (16) and H. tuberosus. The 
presence of isoforms in P. somniferum was further supported by amino acid sequence 
analysis of the purified reductase as compared to the sequence identified through cDNA 

30 cloning. In addition, isolation of a cDNA encoding cytochrome P-450 reductase from E. 
californica indicated the presence of two isoforms in this plant species as well. The 



5 



The cytochrome P-450 oxidase that was used in these experiments was the C-0 
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presence of at least two genes in each genome was coroborated by genomic DNA gel 
blot analysis. 

The cDNA encoding one cytochrome P-450 reductase isoform from each P. 
somniferum and E. californica was functionally expressed in yeast in an autonomously 
5 replicating vector and in an integrative vector with transcription under the control of the 
GAL J gene promoter. These vector constructions resulted in a 6- to 15-fold increase in 
reductase activity as compared to the activity from the endogenous yeast reductase alone. 
Likewise, expression of the reductases in insect cell culture using a baculovirus 
expression vector produced a 4- to 10-fold increase in reductase activity. Improved 

10 heterologous expression was obtained when the 5*- noncoding sequences were 
completely removed from the cDNAs. 

Co-expression of the Eschscholzia reductase with the plant oxidase berbamunine 
synthase (7) in insect cell culture indicated that the amount of plant reductase present 
exerted an influence on the ratio of the products that were enzymatically formed. A first 

15 indication of this effect was shown by reconstitution of purified heterologously 
expressed berbamunine synthase reconstituted with Berberis reductase or with porcine 
reductase (7). Since it is difficult to standardize the lipids when reductase and oxidase 
are purified from microsomal membranes originating from different organisms, a co- 
infection of insect cells with reductase and oxidase is one method by which to avoid the 

20 varying effects of lipids. An increasing amount of plant reductase resulted in a shift in 
the ratio of products formed by berbamunine synthase from R,S:R,R in a ratio of 15:85 in 
the absence of Eschscholzia reductase to R,S:R t R (37:63) when a two-fold excess of 
baculovirus containing the Eschscholzia reductase was used for the co-infection. These 
results indicate that the cytochrome P-450 reductase may influence the binding of 

25 substrate to berbamunine synthase. Although the FMN, FAD and NADPH-binding 
domains of cytochrome P-450 reductase have been identified by sequence comparisons 
with well studied flavoproteins, less is known about the substrate binding sites (27,28). 
Interaction with the non-physiological substrate cytochrome c has been demonstrated by 
chemical cross-linking (29) and by site-directed mutagenesis (30) to involve an acidic 

30 region between amino acid residues 200-220 of rat cytochrome P-450 reductase, but an 
elucidation of the specific interaction between reductase and cytochrome P-450 has not 
yet been reported. - 
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Example 6: Transformation of poppy plant cells with nucleotide sequences 
encoding cytochrome P-450 reductase proteins. 

General protocols for transformation of Papaver somniferum cell suspensions 
have been published in the literature and would be known to skilled addressees in the 
5 field. 

The identification and cloning of genes for cytochrome P-450 reductase enzymes 
now provides means by which the pathway of alkaloid metabolism can be regulated, 
specifically by alleviating the rate limiting steps which rely on cytochrome P-450. This 
in turn provides means of obtaining poppy plants with increased yield of alkaloids. 

10 However, there will be instances where it may be preferrable to manipulate the 

alkaloid metabolism of a poppy plant by suppression of genes encoding the P-450 
reductases. The expression in the poppy of the cDNA encoding a P-450 reductase 
enzyme or part thereof, in an antisense orientation can be used to achieve this such that 
the expression directs the inhibition of the endogenous cytochrome P-450 reductase gene 

1 5 or homologues. In addition, the cDNA encoding the P-450 reductase enzyme or part 
thereof could be expressed in the sense orientation to direct the co-suppression of the 
endogenous cytochrome P-450 reductase gene or homologues. Furthermore, the cloned 
cDNA sequence can be used to design ribozyme sequences such as the hammerhead or 
hairpin ribozymes that can be used to suppress the target gene by inactivation of the 

20 endogenous cytochrome P-450 reductase gene mRNA. The genes encoding the sense, 
antisense or ribozymes can be delivered as transgenes stably integrated into the poppy 
genome or transiently in the form of a viral vector. 

Although the invention has been described with reference to specific embodiments, 
modifications that are within the knowledge of those skilled in the art are also 

25 contemplated as being within the scope of the present invention. 

DATED this 29th day of August, 1997 

JOHNSON & JOHNSON RESEARCH PTY LIMITED 

Attorney: IAN T. ERNST 
Fellow Institute of Patent Attorneys of Australia 
30 of SHELSTON WATERS 
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Peptide 1 KVTIFFGTQK 

Peptide 2 KWDLDDYAADDDEFEEK 

E 

Peptide 3 KWFTEVAK 
D 

Peptide 4 KWDEIIVEK 

Peptide 5 KYADLLNFPK 

Peptide 6 KAALHALAK 

Peptide 7 KD VHRTLHT I VQEQGS LDS S K 
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Arabidopsis tha liana ..350 GSPLES-AVPPPFPGPCT 
Catharanthus roseus ..389 GTPLAGSSLPPPFP-PCT 
Helianthus tuberosus . .331 GTPLGGPTLQPPFP - PCT 
Vigna radlata ..366 GTSLGG-SLLPPFPGPCS 

Vicia sativa ..367 GTSLGG- SLLPPFPGPCT 

* * **** ** 



LGTGLAR YADLLNP PRKS ALVAIiAAYATE P S EAE KLKHLTS PDGKD 
LRTALTRYADLLNTPKKSAIjIiAIiAAYASDPNEADRL^ PAGKD 
LRKALTNYADLLSSPKKSTLIiAIiAAHASDATKADRLQFLASREGKD 
LJlTAIiARYADIiLNPPRKAAIiLAIiATHASEPS - DERLKFLS S PQGKD 
VRTAIJVCYADIiLNPPRKAAIVALAAHASEPSEAERIiKFLSSPQGKD 
* ***** *.* .*** *. * ** *** 

EYSQWIVASQRSLLEVMAAFPSAKPPLGVFFAAIAPRLQPRYYS IS 
E YAQ S LVANQRS LLEVMAEFPS AKP PLGVFFAAI APRLQPRFYS I S 
EYAEWI VANQRSLLEVMEAFPSAKPPLGVFFAAIAPRLQPRYYS I S 
EYS KWWGSQRS LVEVMAEFPS AKPPIiGVFFJ\AIAPRLQPRYYS I S 
EYSKWWGSQRSLLEVMADFPSAKPPLGVFFAAIAPRLQPRYYS I S 
** * **** *** ********************** **** 



SCQDWAPSRVHVTSALVYGPTPTGRIHKGVCSTWMKNAVP 4 98. 
SSPRMAPSRIHVTCALVYEKTPGGRIHKGVCSTWMKNAIP 537. 
S S PKMVPNR I HVTCALVYEKTPGGR I HKG I CS TWMKNAVP 479. 
S S PRFAPQRVHVTCAIiVYGPTPTGR IHKGVCSTWMKNAI P 513. 
SSPRPAPQRVHVTCALVEGPTPTGRIHKGVCSTWMKSATP 515 . 

* * * *** *** ** ****** ****** * 



Figure 3. 
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1 50 

MEQTAVKVSL FDLFSSILNG KLDPSNFSSD SSAAILIENR EILMILTTAI 

MGSNNLANS I ESMLG.ISIG . . . . SEYISD P IFIMVTTVA 

* * ***** * ** 

51 100 

AVFIGCGFLY VWRRSSNKSS KIVETQKLIV EKEPE . PEVD DGKKKVTIFF 

SMLIGFGFFV CMK.SSSSQS KPIETYKPII DKEEEE I EVD PGKIKLTIFF 
** ** ** * * ** * * ** * *** ** * **** 

101 150 

GTQTGTAEGF AKALAEEAKA RYEKAIFKVI DLDDYGADDD EFEEKLKKET 

GTQTGTAEGF AKALAEEIKA KYKKAWKW DLDDYAAEDD QYEEKLKKES 

********** ******* ** * ** ** ***** * ** ******* 

151 200 

IALFFLATYG DGE PTDNAAR FYKWFTEGKE REMWLQNLQF GVFGLGNRQY 

LVFFMVATYG DGE PTDNAAR FYKWFTQEHE RGEWLQQLTY GVFGLGNRQY 

* **** ********** ****** * * *** * ********** 

201 250 

EHFNKVAKEV DEILTEQGGK RIVPVGLGDD DQCIEDDFTA WRELVWPELD 

EHFNKIAVDV DEQLGKQGAK RIVQVGLGDD DQCIEDDFTA WRELLWTELD 

***** * * ** * ** * *** ****** ********** **** * *** 

251 300 

QLLLDESDKT SVSTPYTAIV PEYRWFHDA TDASLQDKNW SNANGYTVYD 

QLLKDEDAAP SVATPYIATV PEYRWIHET TVAALDDKHI NTANGDVAFD 

*** ** ** *** * * ****** * * * * ** *** * 

301 350 

VQHPCRANW VKKELHTPVS DRSCIHLEFD I SGTGLTYET GDHVGVYSEN 

ILHPCRTIVA QQRELHKPKS DRSCIHLEFD ISGSSLTYET GDHVGVYAEN 

**** * *** * * ********** *** ***** ******* ** 

351 400 

CVEWEEAER LLGYSSDTVF SIHVDKEDGS PISGSALAPP FPTPCTLRTA 

CDETVEEAGK LLGQPLDLLF SIHTDKEDGS PQGSS . . LPP FPGPCTLRSA 

+ * **** *** * * +** ****** * * ** ** ***** * 

401 450 

LTRYADLLNS PKKAALHALA AYASDPKEAE RLRYLAS PAG KDEYAQWIVA 

LARYADLLNP PRKASLIALS AHAS VPS EAE RLRFLSSPLG KNEYSKWWG 

* ******* + ** * ** * ** * *** *** * ** * * ** + * 

451 500 

SQRSLLWMA EFPSAKAPIG VFFAAVAPRL LPRYYSISSS NRMVPSRIHV 

SQRSLLEIMA EFPSAKPPLG VFFAAVAPRL PPRYYSISSS PKFAPSRIHV 
****** ** ****** * * ********** ********* ****** 

501 550 

TCALVHEKTP AGRVHKGVCS TWMKNSVSLE ENHDCSSWAP IFVRQSNFKL 

TCALVYGQSP TGRFHRGVCS TWMKHAVPQD S WAP I FVRTSNFKL 

****** * *********** * ************ 

551 600 

PADSTVPIIM IGPGTGLAPF RGFMQERLAL KNSGVELGPA ILFFGCRNRQ 

PADPSTPIIM VGPGTGLAPF RGFLQERMAL KENGAQLGPA VLFFGCRNRN 

*** **** ********* *** *** ** * * **** ******** 

601 650 

MDYIYEEELN NFVKEGA I S E WVAFS REG A TKEYVQHKMA EKASYIWEMI 

MDFIYEDELN NFVERGVISE LVIAFSREGE KKEYVQHKMM EKATDVWNVI 
** *** *** *** * *** * ****** ******** *** * * 

651 700 

SQGAYLYVCG DAKGMARDVH RTLHTIAQEQ GSLDNSKTES LVKNLQMDGR 

SGDGYLYVCG DAKGMARDVH RTLHTIAQEQ GSMESSAAEA AVKKLQVEER 

* ****** ********** ********** ** * * ** ** * 

701 

YLRDVW 
YLRDVW 
****** 
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1/1 * 31/11 

CGG CAC GAG CTT GTT AGT ATC TTC TAG GGT TTG AAA AGA AGC ACA GGG AGA AGC AAA AGT 
RHEL.VS IF* GLKRSTGRSKS 
61/21 91/31 

CGA ATC TAC TTG AAA TAC ATT CGA TTG CTT CTC TCT GTT TAA GCT TCA GAG TCT CTG CTA 

RIYLKYIRLLLSV*ASESLL 
121/41 151/51 

ATT ATG GGT TCG AAT AAT TTA GCT AAT TCG ATT GAA TCG ATG TTA GGA ATA TCA ATA GGA 
IMGSNNLANS IESMLGI S IG 
181/61 211/71 

TCA GAA TAT ATT TCT GAC CCA ATT TTC ATT ATG GTC ACA ACT GTA GCT TCA ATG CTG ATT 
SEYISDPIFIMVTTVASMLI 
241/81 271/91 

GGA TTT GGT TTC TTC GCA TGT ATG AAA TCT TCG TCT TCT CAA TCA AAA CCT ATT GAA ACT 
GFGFFACMKSSSSQSKPIET 
301/101 331/111 

TAT AAA CCA ATA ATT GAT AAA GAA GAA GAG GAG ATT GAA GTT GAT CCT GGT AAA ATT AAG 
YKPI IDKEEEEIEVDPGKIK 
361/121 391/131 

CTC ACT ATA TTT TTT GGT ACT CAG ACT GGT ACT GCT GAA GGA TTT GCT AAG GCA TTG GCA 
Ij T I FFGTQTGTAEGF AKALA 
421/141 451/151 

GAA GAA ATT AAG GCA AAG TAC AAG AAA GCA GTT GTT AAA GTA GTT GAC CTG GAT GAC TAT 
EE I KAKYKKAVVKVVDLDDY 
481/161 511/171 

GCA GCC GAG GAT GAT CAA TAT GAA GAG AAA TTA AAG AAA GAG TCT TTG GTG TTT TTC ATG 
AAEDDQYEEKLKKESLVFFM 
541/181 571/191 

GTA GCC ACT TAT GGT GAT GGT GAG CCA ACT GAC AAT GCT GCG AGA TTT TAC AAA TGG TTC 
VATYGDGE PTDNAARFYKWF 
601/201 631/211 

ACT CAG GAA CAT GAA AGG GGA GAG TGG CTT CAG CAA CTA ACT TAT GGT GTT TTT GGT TTG 
TQEHERGEWLQQLTYGVFGL 
661/221 691/231 

GGT AAC CGT CAA TAC GAG CAT TTC AAC AAG ATC GCG GTA GAT GTG GAT GAG CAA CTC GGT 
GNRQYEHFNKIAVDVDEQLG 
721/241 751/251 

AAA CAA GGT GCA AAG CGC ATT GTT CAA GTG GGG CTC GGT GAC GAT GAT CAA TGC ATT GAA 
KQGAKR IVQVGLGDDDQCI E 
781/261 811/271 

GAT GAT TTT ACT GCT TGG CGA GAA TTG TTG TGG ACT GAA TTG GAT CAG TTG CTC AAA GAT 
DDFTAWRELLWTELDQLLKD 
841/281 871/291 

GAG GAT GCT GCT CCT TCA GTG GCT ACA CCG TAT ATT GCT ACT GTT CCT GAA TAC AGG GTA 
EDAAPSVATPYIATVPEYRV 
901/301 931/311 

GTG ATT CAC GAA ACT ACG GTC GCG GCT CTG GAT GAT AAA CAC ATA AAT ACT GCT AAC GGC 
VIHETTVAALDDKH INTANG 
961/321 991/331 

GAT GTT GCA TTT GAT ATT CTC CAT CCT TGC AGA ACC ATT GTT GCT CAA CAA AGA GAG CTC 
DVAFDILHPCRTIVAQQ .REL 
1021/341 1051/351 
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CAC AAA CCC AAG TCT GAT AGA TCC TGT ATA CAT CTG GAG TTC GAC ATA TCA GGC TCT TCC 

HKPKSDRSCIHLEFDISGSS 
1081/361 1111/371 

CTT ACA TAT GAG ACT GGA GAT CAT GTT GGT GTT TAT GCT GAG AAC TGC GAT GAA ACT GTC 
LTYETGDHVGVYAENCDETV 
1141/381 1171/391 

GAG GAA GCA GGG AAG CTG TTG GGT CAA CCC CTG GAT TTG CTG TTT TCA ATT CAC ACG GAT 

EEAGKLLGQPLDLLFSIHTD 
1201/401 1231/411 

AAA GAA GAC GGG TCA CCC CAG GGA AGC TCA TTA CCA CCT CCT TTC CCA GGT CCT TGC ACC 
KEDGSPQGSSLPPPFPGPCT 
1261/421 1291/431 

TTA CGA TCT GCC CTA GCA CGC TAT GCT GAT CTT TTG AAT CCT CCT AGA AAG GCT TCT CTG 
LR SALARYADLLNP PRKAS L 
1321/441 1351/451 

ATT GCT CTG TCC GCT CAT GCA TCT GTA CCC AGT GAA GCA GAG AGA TTG CGC TTT TTG TCA 
IALSAHASVPSEAERLRFLiS 
1381/461 1411/471 

TCA CCT CTG GGA AAG AAT GAG TAT TCA AAA TGG GTA GTT GGA AGT CAG AGG AGT CTT TTG 
S PLGKNEYSKWVVGSQRSLL 
1441/481 1471/491 

GAG ATC ATG GCC GAG TTT CCA TCA GCA AAA CCC CCT CTT GGT GTT TTC TTT GCT GCA GTA 
E IMAEFPSAKPPLGVFFAAV 
1501/501 1531/511 

GCC CCT CGC TTA CCG CCT CGA TAC TAT TCT ATC TCA TCC TCT CCT AAG TTT GCT CCC TCA 
APRLPPRYYSISSSPKFAPS 
1561/521 1591/531 

AGA ATT CAT GTG ACG TGT GCT TTA GTA TAT GGT CAA AGC CCT ACC GGA AGG GTT CAC CGA 
R I HVTCALVYGQS PTGRVHR 
1621/541 1651/551 

GGA GTG TGT TCG ACA TGG ATG AAG CAT GCA GTT CCT CAG GAT AGC TGG GCT CCT ATT TTT 
GVCSTWMKHAVPQDSWAPI F 
1681/561 1711/571 

GTT CGA ACG TCA AAC TTC AAG TTA CCA GCT GAC CCC TCA ACT CCA ATT ATC ATG GTG GGA 
VRTSNFKLPADPSTPI IMVG 
1741/581 1771/591 

CCT GGT ACA GGG TTA GCT CCT TTC AGA GGA TTT CTG CAG GAA AGA ATG GCC CTC AAG GAA 
PGTGLAP FRGFLQERMALKE 
1801/601 1831/611 

AAT GGT GCT CAA CTT GGC CCA GCA GTG CTC TTT TTC GGA TGT AGG AAT CGT AAT ATG GAC 
NGAQLGPAVLFFGCRNRNMD 
1861/621 1891/631 

TTC ATT TAT GAA GAC GAA CTA AAC AAC TTC GTG GAA CGA GGA GTA ATT TCG GAG CTA GTT 
F IYEDELNNFVERGVI SELV 
1921/641 1951/651 

ATT GCC TTT TCA CGT GAA GGG GAA AAG AAG GAA TAT GTT CAA CAT AAG ATG ATG GAG AAA 
IAFSREGEKKEYVQHKMMEK 
1981/661 2011/671 

GCA ACG GAT GTA TGG AAT GTG ATA TCA GGG GAC GGT TAT CTC TAT GTG TGT GGT GAT GCC 
ATDVWNVI SGDGYLYVCGDA 
2041/681 2071/691 
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AAG GGA ATG GCC AGA GAT GTC CAT CGC 

KGMARDVHR 

2101/701 

ATG GAA TCA TCT GCT GCC GAA GCT GCA 

MESSAAEAA 

2161/721 

AGA GAT GTC TGG TGA TCG AAT GTA GCT 

RDVW*SNVA 

2221/741 

TGG TTT CTA TTA TAT TAT TGA TCC TCC 

WFLLYY*SS 

2281/761 

GAT TCT TCC TCC AGT GGT TCC AAA TCG 

DSSSSGSKS 

2341/781 

ACT ACA TGA GAA GCA AAC ATC GAA TAC 
TT* EANIEY 
2401/801 

GAA CAA TGT TAC AGG CAA AAC TGT GTT 

EQCYRQNCV 

2461/821 

CAA CAC TGA AAC AGT ATT AGC TAT ACC 

QH*NSISYT 

2521/841 

GAT CTT CTC TTT GGA TTG ATT ACT GTA 

DLLFGLITV 

2581/861 

TTC TTG TTT TCT TAT GGC TAC CGA GAG 

FLFSYGYRE 

2641/881 

AAA AAA AAA A 

K K K 
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ACG TTG CAT ACC ATT GCC CAA GAA CAG GGA CCC 
TLHTIAQEQGP 
2131/711 

GTA AAG AAA CTC CAA GTT GAA GAA CGA TAT CTA 
VKKLQVEERYL 
2191/731 

TGC CAA GTC CCC TTT TCT TGG CTG GTC TGT TTA 
CQVPFSWLVCL 
2251/751 

TCT GAA AAT CCC AAG CAC TTC CAG ACA TCC CTC 
SENPKHFQTSL 
2311/771 

AAG CTC GGT ATA ATT GAG AGC AGT GCA ATT GTG 
KLGIIESSAIV 
2371/791 

CAT AGA ATT AGA AAG ATC AAA ATT CTC TTA TCA 
HRIRKIKILLS 
2431/811 

TGC TTA ATA TAA ATT TCA CAC CAT GGG TGT GGA 
CLI*ISHHGCG 
2491/831 

AAC AAA GTT ATG CAA GGA AAC ACA AAC TAG TTA 
NKVMQGNTN* L 
2551/851 

AGT TCT AAC CAG ATG ATA GAT TGT ACT TAA AGA 
SSNQMIDCT* R 
2611/871 

GAG TAT ATT AAT GCA TTT AGA GTT TTG AGA AAA 
EYINAFRVLRK 



Figure 9a (cont.). 




1/1 

TTC TTC TTC CAA TCG CAT TCG AGA AAA 

FFFQSHSRK 

61/21 

TCA GAA ACA CTG AAG CTC ATC ATC ATC 

SETLKLIII 

121/41 

AAA ACT ATG GAA CAA ACT GCG GTT AAA 

KTMEQTAVK 

181/61 

AAT GGA AAG TTG GAT CCG TCG AAC TTT 

NGKLDPSNF 

241/81 

AAT CGT GAG ATT TTA ATG ATC TTA ACA 

NREILMILT 

301/101 

CTC TAC GTT TGG AGA AGA TCT TCA AAT 

LYVWRRSSN 

361/121 

ATC GTT GAA AAG GAA CCA GAA CCT GAA 
IVEKEPEPE 
421/141 < 
TTT GGT ACT CAA ACT GGT ACA GCT GAA 
FGTQTGTAE 
481/161 

GCA AGA TAT GAA AAG GCA ATC TTT AAA 

ARYEKAIFK 

541/181 

GAT GAA TTC GAA GAG AAA TTG AAA AAG 

DEFEEKLKK 

601/201 

GGA GAT GGT GAA CCT ACA GAT AAT GCT 

GDGEPTDNA 

661/221 

AGG GAA ATG TGG CTC CAG AAT CTT CAA 

REMWLQNLQ 

721/241 

GAG CAT TTC AAT AAG GTG GCA AAG GAG 

EHFNKVAKE 

781/261 

CGT ATT GTT CCC GTG GGT CTA GGA GAT 

RIVPVGLGD 

841/281 

TGG CGG GAG TTG GTA TGG CCT GAA TTG 

WRELVWPEL 

901/301 

TCT GTT TCT ACT CCT TAC ACT GCC ATC 

SVSTPYTAI 

961/321 

ACT GAT GCA TCA CTA CAA GAC AAA AAC 

TDASLQDKN 

1021/341 




14/20 

31/11 

TTC AAT CAT CTT CAA CTT CAG GAA GAA GAA TCA 
FN HLQL QEEES 
91/31 

CTT GAA ACT TAT CGT CTT TGT TTG ACC TTT TGA 
LETYRLCLTF* 
151/51 

GTC TCT TTG TTT GAT CTA TTT TCT TCG ATA CTT 
VSLFDLFSSIL 
211/71 

TCT TCA GAT TCA AGT GCT GCT ATT TTG ATT GAA 
SSDSSAAILIE 
271/91 

ACT GCT ATT GCT GTT TTT ATC GGT TGT GGT TTT 
TAIAVFIGCGF 
331/111 

AAG TCG AGT AAA ATT GTT GAA ACT CAG AAA TTG 
KSSKIVETQKL 
391/131 

GTT GAT GAT GGA AAG AAG AAG GTT ACT ATC TTC 
VDD GKK KVTI F 
451/151 

GGA TTC GCA AAG GCA CTT GCT GAA GAA GCA AAA 
GFAKALAEEAK 
511/171 

GTG ATT GAT CTG GAT GAT TAC GGA GCA GAT GAT 
VIDLDDYGADD 
571/191 

GAA ACT ATA GCT CTT TTC TTT TTG GCT ACC TAT 
ETIALFFLATY 
631/211 

GCA AGA TTT TAT AAA TGG TTC ACA GAG GGA GAG 
ARFYKWFTEGE 
691/231 

TTT GGT GTC TTC GGT CTA GGC AAT AGA CAG TAT 
FGVFGLGNRQY 
751/251 

GTG GAC GAG ATA CTC ACT GAA CAG GGT GGG AAG 
VDEILTEQGGK 
811/271 

GAT GAT CAA TGC ATA GAA GAT GAT TTC ACT GCG 
DDQCIEDDFTA 
871/291 

GAT CAG TTG CTC CTT GAT GAA AGT GAT AAA ACA 
DQLLLDESDKT 
931/311 

GTA CCA GAA TAC AGG GTA GTA TTC CAT GAT GCT 
VPEYRVVFHDA 
991/331 

TGG AGC AAT GCA AAT GGC TAC ACT GTT TAC GAC 
WSNANGYTVYD 
1051/351 
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GTT CAA CAC CCA TGC AGA GCC AAT GTC GTT GTA AAG AAG GAG CTT CAC ACT CCA GTA TCT 

VQHPCRANVVVKKELHTPVS 
1081/361 1111/371 

GAT CGT TCT TGT ATT CAT CTG GAA TTT GAC ATT TCT GGC ACT GGG CTC ACG TAT GAA ACA 

DRSCIHLEFDISGTGLTYET 
1141/381 1171/391 

GGA GAC CAT GTC GGT GTT TAC TCT GAG AAT TGT GTT GAA GTT GTC GAG GAA GCA GAG AGG 
GDHVGVYSENCVEVVEEAER 
1201/401 1231/411 

CTA TTG GGT TAC TCA TCA GAC ACC GTT TTT TCA ATC CAT GTC GAT AAA GAG GAC GGC TCA 
L L G Y S SDTVFS IHVDKEDGS 
1261/421 1291/431 

CCC ATT AGT GGA AGC GCT CTA GCT CCT CCT TTT CCA ACT CCC TGC ACT CTA AGA ACA GCA 
PISGSALAPPFPTPCTLRTA 
1321/441 1351/451 

CTA ACA CGA TAC GCT GAT CTG TTG AAT TCT CCC AAG AAG GCT GCT CTG CAT GCT TTG GCT 
LTRYADLiLNS PKKAALHALA 
1381/461 1411/471 

GCT TAT GCA TCC GAT CCA AAG GAA GCG GAG CGA CTA AGG TAT CTT GCG TCT CCT GCT GGG 
AYASDPKEAERLRYLASPAG 
1441/481 1471/491 

AAG GAC GAA TAC GCC GAG TGG ATA GTA GCT AGT GAG AGA AGT CTG CTA GTG GTC ATG GCT 
KDEYAQWIVASQRSLLVVMA 
1501/501 1531/511 

GAA TTC CCA TCA GCA AAG GCT CCA ATT GGG GTT TTC TTT GCA GCA GTA GCT CCT CGC TTG 
EFPSAKAP I GVFFAAVAPRL 
1561/521 1591/531 

CTG CCA AGA TAC TAT TCT ATT TCA TCT TCC AAT AGG ATG GTA CCA TCT AGG ATT CAT GTC 
LPRYYSISSSNRMVPSRIHV 
1621/541 1651/551 

ACA TGT GCA TTG GTG CAT GAA AAA ACA CCG GCA GGT CGG GTT CAC AAA GGA GTG TGT TCA 
TCALVHEKTPAGRVHKGVCS 
1681/561 1711/571 

ACC TGG ATG AAG AAT TCT GTG TCT TTG GAA GAA AAC CAT GAT TGC AGC AGC TGG GCA CCA 
TWMKNSVSLEENHDCSSWAP 
1741/581 1771/591 

ATC TTT GTC AGG CAA TCC AAC TTC AAA CTT CCT GCT GAT TCT ACA GTA CCA ATT ATA ATG 
I FVRQSNFKLPADSTVPI IM 
1801/601 1831/611 

ATT GGT CCT GGG ACT GGA TTA GCT CCC TTT AGG GGA TTC ATG CAG GAG CGA TTA GCT CTG 
IGPGTGLAPFRGFMQERLAL 
1861/621 1891/631 

AAG AAT TCT GGT GTA GAA TTG GGA CCC GCT ATC CTC TTC TTT GGA TGC AGA AAC AGA CAG 
KNSGVELGPAILFFGCRNRQ 
1921/641 1951/651 

ATG GAT TAC ATA TAT GAA GAG GAG CTA AAC AAC TTT GTG AAA GAG GGA GCT ATC TCC GAA 
MDY I Y E E E LNNFVKEGA I S E 
1981/661 2011/671 

GTT GTT GTT GCT TTC TCA CGT GAG GGA GCT ACC AAG GAA TAC GTA CAA CAT AAA ATG GCG 
VVVAFS REG ATKEYVQHKMA 
2041/681 2071/691 
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GAG AAG GCT TCC TAC ATC TGG GAA ATG ATC TCT CAA GGT GCT TAT CTT TAT GTA TGT GGT 

EKASYIWEMISQGAYLYVCG 
2101/701 2131/711 

GAT GCC AAG GGC ATG GCT AGA GAC GTA CAT CGA ACT CTC CAC ACC ATT GCC CAG GAA CAG 

dakgmardvhrtlht iaqeq 

2161/721 2191/731 

GGA TCT TTG GAC AAC TCG AAG ACC GAA AGC TTG GTG AAG AAT CTA CAG ATG GAT GGA AGG 

gsldnskteslvknlqmdgr 

2221/741 / 2251/751 

TAT CTA CGT GAT GTG TGG TGA TTG ATT TTT TCA GCA CGG TTA CAA TCT AGC TTC ATC AAA 

YLRDVW^LIFSARL' QSSFIK 
2281/761 ( 2311/771 

GAA CGC GCT TGA GAA GCA TAA ATC TTA GTT GCA GAG ATG TTG ATT TCA GAA GAA ATG CTT 
E R A * E A * ILVAEMLI SEEML 
2341/781 2371/791 

TAT ATA CTT GAG GTA GCG GAC ATT AAT CCT TTT CTC TCT CTC TAA ACT GTT AAT CCT GTA 

YILEVADINPFIiSL*TVNPV 
2401/801 2431/811 

AAA AAG GGA TTG CTG TTT GTG TTT GCT CGC AAT CAA TTA AGT TAT ATT CTT TGG TCT ATG 

KKGLLFVFARNQLSYILWSM 
2461/821 2491/831 

GCA TTC GTT AGA CAA ATA TAT TAA CGA GTT TGT CCG TTA TAT ATG ACA TAT GAA ACA AAA 
A F V R Q I Y * RVCPLYMTYETK 
2521/841 2551/851 
GAA CTT CTG TTT GGA GGA AGA GAA AAA AAA AAA AAA AA 
ELLFGQREK.KK>K 
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1 AAGCTTCAGAGTCTCTGCTAATT ATG GGT TCG AAT AAT 
1 M G S N N 

66 GGA ATA TCA ATA GGA TCA GAA TAT ATT TCT GAC 
15GISIGSEYISD 

126 GCT TCA ATG CTG ATT GGA TTT GGT TTC TTC GCA 
35 AS ML I G FG F FA 

186 AAA CCT ATT GAA ACT TAT AAA CCA ATA ATT GAT 
55KPIETYKPIID 

246 CCT GGT AAA ATT AAG CTC ACT ATA TTT TTT GGT 
75PGKI KLTI FFG 

3 06 GCT AAG GCA TTG GCA GAA GAA ATT AAG GCA AAG 
95AKALAEE I KAK 

366 GAC CTG GAT GAC TAT GCA GCC GAG GAT GAT CAA 
115 DLDDYAAEDDQ 

4 26 TTG GTG TTT TTC ATG GTA GCC ACT TAT GGT GAT 
135 LVFFMVATYGD 

486 TTT TAC AAA TGG TTC ACT CAG GAA CAT GAA AGG 
155 FYKWFTQEHER 

546 GGT GTT TTT GGT TTG GGT AAC CGT CAA TAC GAG 
175 GVFGLGNRQYE 

606 GAT GAG CAA CTC GGT AAA CAA GGT GCA AAG CGC 
195 DEQLGKQGAKR 

666 GAT CAA TGC ATT GAA GAT GAT TTT ACT GCT TGG 
215 DQCI EDDFTAW 

726 CAG TTG CTC AAA GAT GAG GAT GCT GCT CCT TCA 
235 QLLKDEDAAPS 

786 CCT GAA TAC AGG GTA GTG ATT CAC GAA ACT ACG 
255 PEYRVV I HETT 

846 AAT ACT GCT AAC GGC GAT GTT GCA TTT GAT ATT 
"275 NTANGDVAFDI 

906 CAA CAA- AGA GAG CTC CAC AAA CCC AAG TCT GAT 
295 Q Q R E L H K P K S D 

966 ATA TCA GGC TCT TCC CTT ACA TAT GAG ACT GGA 
315 I-SGSSLTYETG 

102 6 TGC GAT GAA ACT GTC GAG GAA GCA GGG AAG CTG 
335 C P E T V E E A G K L 

1086 TCA ATT CAC ACG GAT AAA GAA GAC GGG TCA CCC 
355 S I. H T D K E D G S P 




TTA GCT AAT TCG ATT GAA TCG ATG TTA 65 

L. ANSIES ML 14 

CCA ATT TTC ATT ATG GTC ACA ACT GTA 125 

PIFIMVTTV 34 

TGT ATG AAA TCT TCG TCT TCT CAA TCA 185 

CMKSSSSQS 54 

AAA GAA GAA GAG GAG ATT GAA GTT GAT 245 

KEEEEIEVD 74 

ACT CAG ACT GGT ACT GCT GAA GGA TTT 305 

TQTGTAEGF 94 

TAC AAG AAA GCA GTT GTT AAA GTA GTT 365 

YKKAVVKVV 114 

TAT GAA GAG AAA TTA AAG AAA GAG TCT 425 

YEEKLKKES 134 

GGT GAG CCA ACT GAC AAT GCT GCG AGA 485 

GEPTDNAAR 154 

GGA GAG TGG CTT CAG CAA CTA ACT TAT 545 

GEWLQQLTY 174 

CAT TTC AAC AAG ATC GCG GTA GAT GTG 605 

HFNKIAVDV 194 

ATT GTT CAA GTG GGG CTC GGT GAC GAT 665 

IVQVGLGDD 214 

CGA GAA TTG TTG TGG ACT GAA TTG GAT 725 

RELLWTELD 234 

GTG GCT ACA CCG TAT ATT GCT ACT GTT 7 85 

VATPYIATV 254 

GTC GCG GCT CTG GAT GAT AAA CAC ATA 845 

VAALDDKH I 274 

CTC CAT CCT TGC AGA ACC ATT GTT GCT 905 

LHPCRTIVA 294 

AGA TCC TGT ATA CAT CTG GAG TTC GAC 965 

RSCIHLEFD 314 

GAT CAT GTT GGT GTT TAT GCT GAG AAC 1025 

DHVGVYAEN 334 

TTG GGT CAA CCC CTG GAT TTG CTG TTT 1085 

LGQPLDLLF 354 

CAG GGA AGC TCA TTA CCA CCT CCT TTC 1145 

Q G S S L P P P F 374. 
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1146 CCA GGT CCT TGC ACC TTA CGA TCT GCC CTA GCA CGC TAT GCT GAT CTT TTG AAT CCT CCT 1205 

375 PGPCTLRSALARYADLLNPP 394 

1206 AGA AAG GCT TCT CTG ATT GCT CTG TCC GCT CAT GCA TCT GTA CCC AGT GAA GCA GAG AGA 1265 

395 RKASLIALSAHASVPSEAER 414 

1266 TTG CGC TTT TTG TCA TCA CCT CTG GGA AAG AAT GAG TAT TCA AAA TGG GTA GTT GGA AGT 1325 

415LRFLSSPLGKNEYSKWVVGS 434 

1326 CAG AGG AGT CTT TTG GAG ATC ATG GCC GAG TTT CCA TCA GCA AAA CCC CCT CTT GGT GTT 1385 

SsQRSLLEIMAEFPSAKPPLGV 454 

1386 TTC TTT GCT GCA GTA GCC CCT CGC TTA CCG CCT CGA TAC TAT TCT ATC TCA TCC TCT CCT 1445 

455 FFAAVAPRLPPRYYSISSSP 474 

1446 AAG TTT GCT CCC TCA AGA ATT CAT GTG ACG TGT GCT TTA GTA TAT GGT CAA AGC CCT ACC 1505 

475 KFAPSRIHVTCALVYGQSPT 494 

1506 GGA AGG GTT CAC CGA GGA GTG TGT TCG ACA TGG ATG AAG CAT GCA GTT CCT CAG GAT AGC 1565 

495 GRVHRGVCSTWMKHAVPQDS 514 

i?66 TGG GCT CCT ATT TTT GTT CGA ACG TCA AAC TTC AAG TTA CCA GCT GAC CCC TCA ACT CCA 1625 

515WAPIFVR TS NF KLPADPS TP 534 

1626 ATT ATC ATG GTG GGA CCT GGT ACA GGG TTA GCT CCT TTC AGA GGA TTT CTG CAG GAA AGA 1685 

535 T IMVGPGTGLAPFRGFLQER 554 

16 86 ATG GCC CTC AAG GAA AAT GGT GCT CAA CTT GGC CCA GCA GTG CTC TTT TTC GGA TGT AGG 1745 

555 MALKENGAQLGPAVLFFGCR 574 

1746 AAT CGT AAT ATG GAC TTC ATT TAT GAA GAC GAA CTA AAC AAC TTC GTG GAA CGA GGA GTA 1805 

^g75 NRNMDFIYEDELNNFVERGV 594 

1806 ATT TCG GAG CTA GTT ATT GCC TTT TCA CGT GAA GGG GAA AAG AAG GAA TAT GTT CAA CAT 1865 

595 ISELVIAFSREGEKKEYVQH 614 

j 

1866 AAG ATG ATG GAG AAA GCA ACG GAT GTA TGG AAT GTG ATA TCA GGG GAC GGT TAT CTC TAT 1925 

615 KMME KATDVWNVI SGDGYLY 634 

1926 GTG TGT GGT GAT GCC AAG GGA ATG GCC AGA GAT GTC CAT CGC ACG TTG CAT ACC ATT GCC 1985 

635 VCGDAKGMARDVHRTLHTIA 654 

1986 CAA GAA CAG GGA CCC ATG GAA TCA TCT GCT GCC GAA GCT GCA GTA AAG AAA CTC CAA GTT 2045 

2C^ 55 qeqgPMESSAAEAAVKKLQV 674 

2046 GAA GAA 'CGA TAT CTA AGA GAT GTC TGG TGA TCGA ATG TAG CTTGCCAAtcactag 2100 

675 EERYLR. DVw/* M* 2 
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1 tgcagcccgggggatCCgccCT ATG GAA CAA ACT GCG GTT AAA GTC TCT TTG TTT GAT CTA TTT 64 

1 MEQTAVKVSLFDLF 14 

65 TCT TCG ATA CTT AAT GGA AAG TTG GAT CCG TCG AAC TTT TCT TCA GAT TCA AGT GCT GCT 124 

15SSILNGKLDPSNFSSDSSAA 34 

125 ATT TTG ATT GAA AAT CGT GAG ATT TTA ATG ATC TTA ACA ACT GCT ATT GCT GTT TTT ATC 184 

35 I L I ENRE I LMILTTAIAVF I 54 

185 GGT TGT GGT TTT CTC TAC GTT TGG AGA AGA TCT TCA AAT AAG TCG AGT AAA ATT GTT GAA 244 

55 G CG FLYVWRRSSNKS S K I VE 74 

245 ACT CAG AAA TTG ATC GTT GAA AAG GAA CCA GAA CCT GAA GTT GAT GAT GGA AAG AAG AAG 304 

75TQKLIVEKEPEPEVDDGKKK 94 

305 GTT ACT ATC TTC TTT GGT ACT CAA ACT GGT ACA GCT GAA GGA TTC GGA AAG GCA CTT GCT 364 

95 V T I FFGTQTGTAEGFAKALA 114 

365 GAA GAA GCA AAA GCA AGA TAT GAA AAG GCA ATC TTT AAA GTG ATT GAT CTG GAT GAT TAC 424 

115 EEAKARYE KAIF. KVIDLDDY 134 

425 GGA GCA GAT GAT GAT GAA TTC GAA GAG AAA TTG AAA AAG GAA ACT ATA GCT CTT TTC TTT 484 

135 GADDDEFEEKLK. KETIALFF 154 

485 TTG GCT ACC TAT GGA GAT GGT GAA CCT ACA GAT AAT GCT GCA AGA TTT TAT AAA TGG TTC 544 

155 LATYGDGE PTDNAARFYKWF 174 

545 ACA GAG GGA GAG AGG GAA ATG TGG CTC CAG AAT CTT CAA TTT GGT GTC TTC GGT CTA GGC 604 

175 TEGEREMWLQNLQFGVFGLG 194 

605 AAT AGA CAG TAT GAG CAT TTC AAT AAG GTG GCA AAG GAG GTG GAC GAG ATA CTC ACT GAA 664 

195 NRQYEH FNKVAKEVDE I LTE 214 

665 CAG GGT GGG AAG CGT ATT GTT CCC GTG GGT CTA GGA GAT GAT GAT CAA TGC ATA GAA GAT 724 

215 QGGKRIVPVGLGDDDQCI ED 234 

72 5 GAT TTC ACT GCG TGG CGG GAG TTG GTA TGG CCT GAA TTG GAT CAG TTG CTC CTT GAT GAA 784 

235 DFTAWRELVWPELDQLLLDE 254 

785 AGT GAT AAA ACA TCT GTT TCT ACT CCT TAC ACT GCC ATC GTA CCA GAA TAC AGG GTA GTA 844 

255 SDKTSVSTPYTA.IVPEYRVV 274 

84 5 TTC CAT GAT GCT ACT GAT GCA TCA CTA CAA GAC AAA AAC TGG AGC AAT GCA AAT GGC TAC 904 

275 FHDATDAS L QDKNWSNANGY 294 

905 ACT GTT TAC GAC GTT CAA CAC CCA TGC AGA GCC AAT GTC GTT GTA AAG AAG GAG CTT CAC 964 

295 TVY DVQH P CRAN VVVKKE LH 314 

965 ACT CCA GTA TCT GAT CGT TCT TGT ATT CAT CTG GAA TTT GAC ATT TCT GGC ACT GGG CTC 1024 

315 TPVSDRSCIHLEFDISGTGL 334 

102 5 ACG TAT GAA ACA GGA GAC CAT GTC GGT , GTT TAC TCT GAG AAT TGT GTT GAA GTT GTC GAG 1084 

335 T Y E T G D H V G >V Y S E" N C V E V V E 354 

1085 GAA GCA GAG AGG CTA TTG GGT TAC TCA TCA GAC ACC GTT -TTT TCA ATC- CAT GTC GAT AAA 1144 

355 EAERLLGYS 7 . SDTV.FS IHVDK 374 
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1145 GAG GAC GGC TCA CCC ATT AGT GGA AGC GCT CTA GCT CCT COT TTT CCA ACT CCC TGC ACT 1204 

375 EDGSPISGSALAPPFPTPCT 394 

1205 CTA AGA ACA GCA CTA ACA CGA TAC GCT GAT CTG TTG AAT TCT CCC AAG AAG GCT GCT CTG 1264 

395 LRTALTRYADLLNSPKKAAL 414 

1265 CAT GCT TTG GCT GCT TAT GCA TCC GAT CCA AAG GAA GCG GAG CGA CTA AGG TAT CTT GCG 1324 

415 HALAAYASDPKEAERLRYLA 434 

1325 TCT CCT GCT GGG AAG GAC GAA TAC GCC CAG TGG ATA GTA GCT AGT CAG AGA AGT CTG CTA 1384 

435 SPAGKDEYAQWIVASQRSLL 454 

1385 GTG GTC ATG GCT GAA TTC CCA TCA GCA AAG GCT CCA ATT GGG GTT TTC TTT GCA GCA GTA 1444 

455 VVMAEFPSAKAP IGVFFAAV 474 

144 5 GCT CCT CGC TTG CTG CCA AGA TAC TAT TCT ATT TCA TCT TCC AAT AGG ATG GTA CCA TCT 1504 

475 APRLLPRYYSISSSNRMVPS 494 

1505 AGG ATT CAT GTC ACA TGT GCA TTG GTG CAT GAA AAA ACA CCG GCA GGT CGG GTT CAC AAA 1564 

495 R I HVTCALVHEKTPAGRVHK 514 

1565 GGA GTG TGT TCA ACC TGG ATG AAG AAT TCT GTG TCT TTG GAA GAA AAC CAT GAT TGC AGC 1624 

515 GVCSTWMKNSVSLEENHDCS 534 

1625 AGC TGG GCA CCA ATC TTT GTC AGG CAA TCC AAC TTC AAA CTT CCT GCT GAT TCT ACA GTA 1684 

535 SWAPIFVRQSNFKLPADSTV 554 



1685 CCA ATT ATA ATG ATT GGT CCT GGG ACT GGA 
555 PIIMIGPGTG 

1745 CGA TTA GCT CTG AAG AAT TCT GGT GTA GAA 
575 RLALKNSGVE 

1805 AGA AAC AGA CAG ATG GAT TAC ATA TAT GAA 
595 RNRQMDYIYE 

18 65 GCT ATC TCC GAA GTT GTT GTT GCT TTC TCA 
615 AISEVVVAFS 

1925 CAT AAA ATG GCG GAG AAG GCT TCC TAC ATC 
635 HKMAEKASYI 

19 85 TAT GTA TGT GGT GAT GCC AAG GGC ATG GCT 
655 YVCGDAKGMA 

2045 GCC CAG GAA CAG GGA TCT TTG GAC AAC TCG 
675 AQ-EQGSLDNS 

2105 ATG GAT GGA AGG TAT CTA CGT GAT GTG TGG 
695 MDGRYLRDVW 



TTA GCT CCC TTT AGG GGA TTC ATG CAG GAG 1744 

LAP FRGFMQE 574 

TTG GGA CCC GCT ATC CTC TTC TTT GGA TGC 1804 

LGPAILFFGC? 594 

GAG GAG CTA AAC AAC TTT GTG AAA GAG GGA 1864 

EELNNFVKEG 614 

CGT GAG GGA GCT ACC AAG GAA TAC GTA CAA 1924 

REGATKEYVQ 634 

TGG GAA ATG ATC TCT CAA GGT GCT TAT CTT 1984 

WEMISQGAYL 654 

AGA GAC GTA CAT CGA ACT CTC CAC ACC ATT 204 4 

RDVHRTLHT I 674 

AAG ACC GAA AGC TTG GTG AAG AAT CTA CAG 2104 

KTESLVKNLQ 694 

TGA TTGgggctagagcggcc 2154 

* 705 
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